Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics.

نویسندگان

  • Amol Prakash
  • Brian Piening
  • Jeff Whiteaker
  • Heidi Zhang
  • Scott A Shaffer
  • Daniel Martin
  • Laura Hohmann
  • Kelly Cooke
  • James M Olson
  • Stacey Hansen
  • Mark R Flory
  • Hookeun Lee
  • Julian Watts
  • David R Goodlett
  • Ruedi Aebersold
  • Amanda Paulovich
  • Benno Schwikowski
چکیده

Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sample collection, preparation, and storage protocols and LC-MS settings and conditions. On the basis of a formally precise and quantitative definition of similarity between LC-MS experiments, we have developed Chaorder, a fully automatic software tool that can assess experimental reproducibility of sets of large scale LC-MS experiments. By visualizing the similarity relationships within a set of experiments, this tool can form the basis of systematic quality control and thus help assess the comparability of mass spectrometry data over time, across different laboratories, and between instruments. Applying Chaorder to data from multiple laboratories and a range of instruments, experimental protocols, and sample complexities revealed biases introduced by the sample processing steps, experimental protocols, and instrument choices. Moreover we show that reducing bias by correcting for just a few steps, for example randomizing the run order, does not provide much gain in statistical power for biomarker discovery.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessing Bias in Experiment Design for Large Scale Mass Spectrometry-based Quantitative Proteomics*□S

Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sa...

متن کامل

Detecting differential protein expression in large-scale population proteomics

MOTIVATION Mass spectrometry (MS)-based high-throughput quantitative proteomics shows great potential in large-scale clinical biomarker studies, identifying and quantifying thousands of proteins in biological samples. However, there are unique challenges in analyzing the quantitative proteomics data. One issue is that the quantification of a given peptide is often missing in a subset of the exp...

متن کامل

Computational and Statistical Methods for Protein Quantification by Mass Spectrometry

The definitive introduction to data analysis in quantitative proteomicsThis book provides all the necessary knowledge about mass spectrometry based proteomics methods and computational and statistical approaches to pursue the planning, design and analysis of quantitative proteomics experiments. The authorвЂTMs carefully constructed approach allows readers to easily make the transition into the ...

متن کامل

Updating JPROT's publication standards for large-scale proteomic studies: towards hypothesis-driven interpretation of predictive biological models.

The development in the 1990s of biological mass spectrometry into a robust analytical tool heralded a paradigm shift in biological research. Advances in instrumentation and methodologies have since fueled an expansion of the scope of biologicalmass spectrometry, from the simple analysis of single proteins to the characterization of highly complex proteomes. The cornerstone of proteomics analysi...

متن کامل

Machine learning methods for predictive proteomics

The search for predictive biomarkers of disease from high-throughput mass spectrometry (MS) data requires a complex analysis path. Preprocessing and machine-learning modules are pipelined, starting from raw spectra, to set up a predictive classifier based on a shortlist of candidate features. As a machine-learning problem, proteomic profiling on MS data needs caution like the microarray case. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Molecular & cellular proteomics : MCP

دوره 6 10  شماره 

صفحات  -

تاریخ انتشار 2007